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ABSTRACT 



PL, 



We present Hubble Space Telescope near-infrared transmission spectroscopy of the 
transiting exoplanet HD 189733b, using Wide Field Camera 3. This consists of time- 
series spectra of two transits, used to measure the wavelength dependence of the 
planetary radius. These observations aim to test whether the Rayleigh scattering haze 
detected at optical wavelengths extends into the near-infrared, or if it becomes trans- 
parent leaving molecular features to dominate the transmission spectrum. Due to sat- 
q uration and non-linearity affecting the brightest (central) pixels of the spectrum, light 

5^ curves were extracted from the blue and red ends of the spectra only, corresponding to 

wavelength ranges of 1.099 — 1.168 pm and 1.521 — 1.693 pm, respectively, for the first 
Ci visit, and 1.082— 1.128 pm and 1.514— 1.671 pm for the second. The light curves were 

fitted using a Gaussian process model to account for instrumental systematics whilst 
, simultaneously fitting for the transit parameters. This gives values of the planet-to-star 

radius ratio for the blue and red light curves of 0.15650±0.00048 and 0.15634±0.00032, 
^ respectively, for visit one and 0.15716±0.00078 and 0.15630±0.00037 for visit 2 (using 

f — , a quadratic limb darkening law). The planet-to-star radius ratios measured in both 

visits are consistent, and we see no evidence for the drop in absorption expected if the 
VsO haze that is observed in the optical becomes transparent in the infrared. This tenta- 

tively suggests that the haze dominates the transmission spectrum of HD 189733b into 
near-infrared wavelengths, although more robust observations are required to provide 
conclusive evidence. 

Key words: methods: data analysis, stars: individual (HD 189733), planetary sys- 
tems, techniques: spectroscopic, techniques: Gaussian Processes 
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1 INTRODUCTION Transiting planets also provide an opportunity to probe 

atmospheric structure and composition, through transmis- 
The study of transiting planets is revolutionising our un- . _ . . 

. , sion and emission (eclipse) spectroscopy, transmission spec- 

derstandmg of planets beyond our Solar System. Transit- . , r,i i , , 1- 

, troscopy is a measurement ol the planet-to-star radius ratio 

mg planets are those which pass between us and their host r ,. c , ,, . .. . . , . . . . 

as a function of wavefength. As the opticaf depth m the at- 

stars, periodically blocking starlight and producing a char- n . , , , . ,, .... , 

J mosphcrc is wavelength dependent, so too is the altitude at 

acteristic light curve. These are vital to studies of planetary , . , , , , ,. , , ... , 

which the pfanet becomes opaque to starlight. Wavelength 
structure, as the radius of a planet and its orbitaf inclination , , , r , ,. 

dependent measurements of the pfanet-to-star radius ratio 
can be measured from the transit, and coupled with radial „ ... , , n . . 

are therefore sensitive to atomic and molecular species m 

velocity data the mass, bulk density and structure may be ,, , , 5—= — i „»»» i rp; i nn r^ k 

. . , J J the atmosphere (e.g. Scagcr & Sasselov 2000 Brown 2001) 

inferred. 



Observations with the Hubble Space Telescope (HST) 
and Spitzer Space Telescope have provided the most de- 
tailed observations of exoplanet atmospheres to date (see 
* E-mail: Neale.Gibson@astro.ox.ac. uk e.g. |Charbonneau et al.|[20CJ2| |Vidal-Madjar et al.|[2003| 
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Charbonneau et al.|2005 Deming et al.|2005 l. In particular, 
HD 189733b is one of the most studied exoplanets, being one 
of the brightest known transiting systems. At optical wave- 
lengths its transmission spectrum is largely featureless, dom- 
inated by Rayleigh scattering from a high altitude haze layer 
from ~0. 3-1. Opm first detected by Pont et al. (20081 us- 



by Berta et al. (2011 1 to observe the transmission spectrum 



ing the ACS instrument, and recently confirmed using STIS 
(Sing et al. 20111. The haze is thought to consist of con- 



densate particles; the most likely candidate being MgSi03 
grains (Lecavelier Des Etangs ct al. 2008a bl. However, Na 



has been detected using higher resolution observations ( Red 
field et al.|2008||Huitson|2011[ ). Transits at UV wavelengths 



have shown the planet to have an escaping hydrogen atmo- 
sphere (Lecavelier Des Etangs et al. 20101. The situation 



in the near-infrared is more debated; whilst the transmis- 
sion spectrum obtained with NICMOS from 1.4-2.5 pm ap- 



parently showed significant H2O and CH4 features (Swain 



et all 2008 1, narrow band photometry with NICMOS (Sing 



et al. 12009 ) proved to be inconsistent with this yet consistent 



with extrapolation of the optical haze. Spitzer observations 
at 3.6pm are again consistent with the haze, although longer 
wavelength observations suggest the presence of molecular 
absorption ( |Desert et al.|2009|[20TT |. 

The interpretation of molecular features in the NIC- 
MOS transmission spectroscopy requires the haze to become 
largely transparent somewhere between ~ 1-1.4 pm, and the 
transmission needs to show significant temporal variation 
for the spectroscopic and narrow-band observations to be 
consistent. A more likely explanation is that the systemat- 
ics in the NICMOS transmission spectroscopy dataset were 



not fully accounted for as suggested by Sing et al. (20091 



In Gibson et al. (2011a I we reanalysed the NICMOS trans- 



mission spectroscopy, and claimed that standard methods to 
model instrumental systematic based on linear basis func- 
tions were not reliable in this case, and that the detections 
of molecular species relied on unjustified assumptions about 
the form of the systematics model. Therefore the uncertain- 
ties are largely underestimated and not precise enough to 
probe for the presence of molecular species. This interpreta- 
tion was largely confirmed in Gibson et al. (2011b I, where 



we developed a Gaussian process (GP) model to extract the 
transmission spectrum robustly in the presence of such sys- 
tematics, although it is important to note that this view is 
still disputed by the original authors ( Swain et al|20l"T l. 

GPs allow the use of non-parametric models for the in- 
strumental systematics, and infers the transit parameters 
in a fully Bayesian framework. Therefore fewer assumptions 
are made about the form of the systematics model, providing 
more robust transmission spectra. The NICMOS transmis- 
sion spectrum of HD 189733b inferred using GPs does not 
exclude the presence of molecular species beyond ~1.5pm; 
however, if they are present, we cannot constrain them very 
well with current data. Either way, the question remains of 
how far the haze observed in the optical extends into the 
infrared. Observations bridging the gap between the optical 
and NICMOS data should help resolve the situation. 

Here we present observations designed for that purpose; 
HST transmission spectroscopy of HD 189733 using the 
Wide-Field Camera 3 (WFC3) from 1.08 - 1.69 pm. These 
observations were originally planned with NICMOS as part 
of program GP 11740 (P.I. F. Pont), before the failure of the 
NICMOS cooling system to restart. WFC3 was recently used 



of G,I1214b, resulting in near-photon limited observations, 
and showing the WFC3 will prove a powerful instrument for 
such measurements. However, here we are using the WFC3 
in a noticeably different regime to observe a much brighter 
star. Sect. |2] describes the observations and data reduction, 
and Sect. [3]describes the analysis of the light curves. Finally 
Sects. [4] and [5] present our results and conclusions. 



2 WFC3 OBSERVATIONS AND DATA 
REDUCTION 

Two transits of HD 189733b were observed with the IR chan- 
nel of HST/WFC3 on 2010 September 4 and 2010 November 
10, as part of program GP 11740 (P.I. F. Pont). Observations 
used the G141 grism, and we obtained slitless spectra from 
~ 1.08 — 1.69 pm with a resolution of A/AA ~ 130 at the 
central wavelength. For each visit, 592 spectra were taken, 
preceded by an image taken with the F167N filter in order to 
set the wavelength calibration. As HD 189733 is not in the 
continuous viewing zone of HST, each transit was observed 
over 4 half-orbits (~48 minute blocks). For both visits, the 
first, second and fourth orbits cover the out-of-transit part 
of the light curve, and consist of 142, 150 and 150 spectra, 
respectively, and the third orbit covers the in-transit part of 
the observation, consisting of 150 spectra. 

As HD 189733 is a particularly bright star (V ~ 
7.67, K ~ 5.54), we used exposure times of only 0.225 sec- 
ondfB These were taken in multiaccum mode with 2 non- 
destructive reads (rapid, nsamp=2). This means that the 
detector is read out three times per exposure, once at the 
start, once mid- way, and once at the end of the exposure. 
Hereafter we refer to these images as the zeroth, first and sec- 
ond readouts. The overheads for bright targets with WFC3 
are dominated by reading out the detector, and dumping 
the temporary storage buffer of WFC3 to the HST solid sate 
recorder. Therefore the camera was used in SUBARRAY mode, 
reading out only the central 128x128 pixels (IRSUB128), 
which contains most (but not all) of the first order spectrum, 
but excludes the zeroth order image and the second order 
spectrum. This reduced the readout time and increased the 
interval between buffer dumps, eliminating the need for a 
long buffer dump during each orbit. We used the shortest 
possible exposure time in this readout mode. This resulted 
in a cadence of ~18 seconds, the minimum allowed by the 
WFC3 electronics. 

The data were reduced using the latest version of 
the CALWF3 data-pipeline (v2.3). This removes bias-drifts, 
subtracts the zeroth readout from each subsequent non- 
destructive read, subtracts dark images, and corrects each 
pixel for non-linearity. As the WFC3 has no shutter, as many 
as ~ 60 000 e~ /pixel were recorded in the zeroth readout im- 
age. These are accumulated between the final reset of the 
detector before an exposure (the detector is continuously 



I It is not possible to defocus WFC3, and at the time of phase 

II preparation, 'driftscan' mode (McCullough &; MacKcnt y|201l| 
was not available. This involves slowly moving the telescope dur- 
ing exposures to smear out the cross-dispersion PSF, therefore 
avoiding saturation of the brightest pixels and allowing longer 
exposures. 
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reset when not recording photons or being read out) and 
the time of zeroth read, and may also be influenced by per- 
sistence effects. The detector exhibits significant non-linear 
behaviour, which is correctable at low to intermediate lev- 
els, and takes the zeroth read counts into account. At high 
levels, > 78 000 e~ /pixel, th e response bec omes highly non- 
linear and not correctable ( |Dressel|[20TT| P| Unfortunately, 
for both the first and second readouts, and for both vis- 
its, a strip of pixels is saturated along the central region of 
the spectra. Fig. [l] shows a plot of the raw images for one 
spectra, with the zeroth, first and second reads shown, after 
subtraction of the bias levels imposed during each detector 
resei|^] It therefore shows the raw electron counts per pixel, 
that contribute to non-linearity. The non-linearity limit of 
> 78 000 e~ /pixel is surpassed by a significant region in the 
centre of the first readout, and most of the second readout, 
therefore these regions cannot provide useful transit light 
curves. We tried to extract useful data from the zeroth read 
spectra, but they resulted in unphysical transit depths, pre- 
sumably related to persistence effects in the detector. Sub- 
tracting the zeroth read is important to remove this effect 
from the subsequent reads. The pipeline does not apply the 
non-linearity correction to the zeroth read prior to this. This 
is a potential source of error in using the current pipeline, 
but the recorded counts in the zeroth read are within the lin- 
ear regime (at least in the regions of the spectrum we finally 
use), so we assume this effect is negligible on our final light 
curves. The pipeline usually only applies the non-linearity 
correction to non-saturated pixels; nonetheless we forced the 
pipeline to apply the correction to all pixels, and proceeded 
to extract spectra for both first and second readouts. 

To extract the spectra, we used a technique similar to 



that used in |Gibson etaL] ( |2011a[ ) for tfST/NICMOS spec- 
troscopy, using a custom pipeline written in python. The 
spectral trace was first determined for each image by fit- 
ting a Gaussian profile in the cross-dispersion axis (y) for 
each of 128 columns along the dispersion direction (x). A 
straight line was fitted to the centres of the profiles to de- 
fine the spectral trace. A sum of 10 pixels was taken along 
the cross-dispersion direction centred on the spectral trace 
to determine the flux for each pixel channel, after subtract- 
ing a background value. The extraction width was chosen 
to minimise the out-of-transit RMS after corrections for 
instrumental systematics were applied (see Sect. [3|. The 
final light curves were not particularly sensitive to vary- 
ing the extraction width between ~7-15 pixels. The back- 
ground was determined from a region on the same pixel col- 
umn above the spectrum, but we also tested a global back- 
ground correction. The final light curves were not sensitive 
to the choice of background region, given the low background 
counts (~35 e~ /pixel) and lack of spatial variation. Fig. [2] 
shows spectra extracted for both runs and for both the first 
and second readouts. The dashed line is first order spectra 
scaled to match the exposure time of the second readout. 
The saturated region is obvious in the second readout, but 



2 This is defined as where the detector counts deviate by more 
than 5% from linearity. 

3 The bias level does not result from physical charges in each 
pixel therefore does not contribute to the detector non-linearity. 




Figure 1. Wireframe plot of the raw images that make up a typi- 
cal exposure, with the zeroth (red), first (green) and second (blue) 
reads shown. The bias levels imposed during each detector reset 
have been subtracted, therefore showing the raw electron counts 
per pixel that contribute to non-linearity. The non-linearity limit 
°f ?J 78 000 e~ /pixel is surpassed by a significant region in the 
centre of the first readout, and most of the second readout, there- 
fore these regions cannot provide useful transit light curves. 



over much of the wavelength range in both the first and 
second readout the flux is non-linear. 

Transits for each pixel channel were then constructed 
from time series of the spectra. The transits suffer from in- 
strumental systematic noise, which is typical of HST obser- 
vations of transit light curves (e.g. ACS: 



Pont et al. 2008 



NICMOS:|Swain et al. 1201381 |Gibson et al.|2011a| STIS: |Sing| 



et al.||2011[ ). These may arise due to thermal breathing and 
temperature variations, plus pointing drifts which aggravate 
pixel-to-pixel sensitivity variations. Transits in the saturated 
regions of the spectra predictably showed shallower than ex- 
pected transit depths (often unphysical). We explored var- 
ious methods to extract useful information from the satu- 
rated regions of the spectra. Indeed, this is why we chose 
to use a custom pipeline rather than standard extraction 
tools, as it allows much more control over the data reduc- 
tion process, and the benefits of more sophisticated meth- 
ods such as optimal extraction over aperture extraction are 
marginal for high S/N data. One such method was to simply 
mask the saturated pixels (the same pixels in every image), 
and sum over the remaining pixels. As the FWHM in the 
cross-dispersion direction is rather narrow (~ 2 pixels), the 
saturated pixel contained much of the signal. More impor- 
tantly, the systematics resultant from pointing drifts were 
greatly enhanced when the central pixel is masked, as small 
variations in position cause a large change in the response. 
After further tests, reluctantly we concluded that useful in- 
formation could not be obtained where the central pixel was 
saturated, and decided to extract two light curves at either 
end of the wavelength range in the first readout spectra, 
within the unsaturated regions. 

For the first visit, we summed the first 15 pixel chan- 
nels, and final 37 pixel channels to form what we will refer 
to as 'blue' and 'red' channels, respectively. Similarly, for 
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20 40 60 80 100 120 140 
x pixel 




20 40 60 80 100 120 140 
x pixel 

Figure 2. Example spectra extracted for one image for the first 
(top) and second (bottom) visits, showing spectra for both the 
first and the second readouts. The dashed green line is the first 
readout scaled to the exposure time of the second. Clearly the 
spectra of the second readout are saturated, but much of the 
wavelength range in both the first and second readout is non- 
linear and not correctable to a scientifically useful level (see text). 
We therefore only extracted light curves from either end of the 
wavelength range. The blue and red dashed-dotted lines show the 
extraction regions used for each visit. Note that the zeroth read is 
subtracted prior to extracting these spectra, and they are summed 
along the y-axis, hence do not correspond in a simple way to the 
detector saturation limit. 



the second visit we extracted the first 11 and final 35 pixel 
channels into the blue and red channels. These extraction re- 
gions are marked in Fig. [2] Fig. [3] shows the extracted blue 
and red channel light curves for the first and second visits. 
The wavelength dispersion of the WFC3/IR G141 grism is 
position dependent, and the position of HD 189733 in the 
direct image was used to calibrate the wavelengths. The blue 
and red channels in the first visit correspond to wavelength 
ranges of 1.099—1.168 um and 1.521 — 1.693 um, respectively, 
and 1.082— 1.128 (im and 1.514— 1.671 um, respectively, for 
the second transit. 

Instrumental systematics are evident in the light curves, 
and must be accounted for to measure the planet-to-star ra- 
dius ratio. We therefore extracted auxiliary parameters from 



the data in a similar way to Gibson et al. (2011a I in order 



to explore systematics models. These were the shift in x- 
position (AX), y-position (AY), and average width (W) 
and angle (ip) of the spectral trace. AX was determined 
from cross-correlation of the extracted spectra in each im- 



age (first readout), Ay and ip from the straight line fits 
to the spectral trace, and W from the average FWHM of 
the Gaussian profiles fitted to each extraction column in the 
cross-dispersion direction. The phase (6hst) of HST was 
also calculated for each time. Unfortunately no useful tem- 
perature information was available in the image headers. 
Berta et al. (20111 provide a detailed description of WFC3 



systematics for transmission spectroscopy, which we do not 
attempt to reproduce here. However, we note that the form 
of the systematics in our light curves are slightly different 
given the different observing strategies used; most notewor- 
thy we do not see a prominent 'ramp' effect. This is likely 
because we do not require buffer dumps during orbits, which 
leads to regular cadence, therefore stable persistence effects 
for each exposure. 



3 ANALYSIS 

3.1 Gaussian process model 

The light curves were modelled using the GP model de- 
scribed inlGibson et al. ( 2011b I , which we briefly recap here. 



Each light curve is modelled as a GP with a transit mean 
function: 

f(t,x)~gv (T(t,<j>),Ji(x,e)), 

where / is the flux, t the time, x the vector containing 
K auxiliary measurements, T is the transit (mean) func- 
tion with transit (hyper-)parameter vector <f> (including the 
planet-to-star radius ratio, p), modelled using the analytic 
equations of |Mandel fc Agol ( 2002]) , and £ is the covari- 
ance matrix with hyperparameter vector 0. The covariance 
between two points x n and x m is given by the squared ex- 
ponential kernel function: 



k(x n , x m ) = £ exp - ^2 rji 



where £ is a hyperparameter that specifies the maximum co- 
variance, r/i are the inverse scale hyperparameters for each 
auxiliary measurement vector, a 2 is the variance due to 
white noise, and S„ m is the Kronecker delta. The kernel 
accounts for the instrumental systematics, and this one de- 
scribes a smooth function of the input parameters, with the 
addition of white noise. In other words, the instrumental 
systematics model assumes a similar value when all relevant 
auxiliary measurements are close, with the 'closeness' spec- 
ified by the inverse length scales. Similarly to other HST 
analyses, we excluded orbit 1 from the fits. This has the 
added advantage of significantly speeding up inversion of 
the covariance matrix and therefore inference of the planet- 
to-star radius ratio for each light curve. 

The definition of a GP states that the likelihood is mul- 
tivariate Gaussian. We multiply the likelihood by gamma 
hyperpriors of shape parameter unity for £ and the rji hy- 
perparameters to form the posterior distribution (implying 
uniform, improper priors for the remaining parameters). The 
scale length of the hyperpriors were set to large values so 
they have little influence over the inference, other than to 
ensure the hyperparameters remain positive. The posterior 
distribution of the planet-to-star radius ratio (p) is found by 
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Figure 3. Light curves for the blue (left) and red (right) channels. Top: Raw extracted light curves, showing systematics that need 
accounting for when inferring the planet-to-star radius ratio. Middle: 'Cleaned' light curves after subtracting the Gaussian process 
systematics model. Bottom: Combined light curves from both visits. The cleaned light curves are shown for illustrative purposes only; 
the planet-to-star radius ratio is inferred simultaneously with the GP systematics model. The first orbit was excluded for the inference, 
but the projected GP model is subtracted nonetheless. 



marginalising over all the other transit parameters and co- 
variance hyperparameters using Markov- Chain Monte-Carlo 
(MCMC), where the fixed transit parameter^] are set to the 
values of Pont et al. (2007, 20 08j) ex cept the ephemeris which 
is calculated from the Agol et al. (20101 value^] The light 
curve function is multiplied by a normalisation (baseline) 
function during fitting, in this case a linear function of time, 
whose parameters we also marginalise over in the MCMC. In 
all, we fit for the planet-to-star radius ratio, the parameters 
of the normalisation function, and all the GP hyperparam- 
eters including the white noise. For each light curve we ran 
four chains and tested for convergence using the Gelman 
& Rubin (GR) statistic ( |Gelman fc Rubin|1992) . For more 
details on the GP model and MCMC routine, see |Gibson| 



in Sing et al. (20111. This was to enable easy comparison 



et al. (201 lb I. The limb-darkening parameters were calcu- 



lated for each light curve using the methods described in 



Sing (20101 for both quadratic and non-linear limb darken- 



ing laws. These were calculated using the ID stellar atmo- 
sphere models rather than the 3D model atmospheres used 



4 We fix the scale length a/R* and inclination i; these param- 
eters are correlated with the planet-to-star radius ratio, but in 
transmission spectroscopy we are interested in the posterior dis- 
tribution of p conditioned on these parameters, i.e. the relative 
values for p. 

5 The more up to date physical parameters of |Agol et al.| j2010 1 
were not used here to enable direct comparison with NICMOS 
and ACS data. 



with ACS and NICMOS transmission spectroscopy, but fu- 
ture studies will use the more reliable 3D limb darkening 
parameters. Inference of the planet-to-star radius ratio was 
performed for both laws holding the limb darkening param- 
eters fixed. 

We decided to take a different approach than for the 
NICMOS data; rather than use all of the auxiliary informa- 
tion, we were careful to use only the inputs that are needed 
to model the shape of the systematics per orbit, but impor- 
tantly do not change the flux offsets between orbits. There- 
fore we first tried using only the orbital phase of HST (9hst) 
as input to the GP. This did not provide a satisfactory cor- 
rection for all orbits, particularly for orbits where a discon- 
tinuity is evident mid way through each orbit in the light 
curve^] We therefore included the cross-dispersion width 
(W) of the spectrum as an additional GP input. Plots of 
Oh st and W for both visits are shown in Fig. [4] These were 
normalised to zero mean and variance of 1 prior to their 
use as GP inputs to ensure the hyperpriors provide similar 
constraints. This GP model provided a satisfactory correc- 
tion for the systematics, and the two visits gave consistent 
results. The MCMC chains were well behaved in this case, 
and the GR statistic was within 1% of unity. These are used 
as our final values for p. We also tested this model including 

6 The discontinuity is caused by HST crossing the Earth's 
shadow. 
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Figure 4. Plots of 8hst and W for both visits as a function 
of orbital phase of HD 189733. These are the auxiliary inputs 
used as input vectors for the final GP model of the instrumental 
systematics. 



orbit 1 in the fitting procedure, which did not significantly 
change the results, nor did using (GP) smoothed versions of 
the W auxiliary parameter. The light curves produced after 
subtracting the GP systematics models are shown in Fig. [3] 
We further tested the GP model using AX, Ay and ip 
as additional GP inputs. The MCMC did not converge in 
this case, or give consistent results for the two runs. These 
additional inputs show similar structure to the W input but 
are much noisier. This causes degeneracy in the GP hyper- 
parameters thus demanding much longer MCMC chains to 
achieve convergence. However, we also ran tests using GP 



smoothed input parameters (see Gibson et al.pOllb I, and 
the MCMC now converged, showing that the level of noise 
on the inputs can also significantly affect convergence. This 
led to consistent values for p when compared to using only 
Ohst and W as inputs, although with larger uncertainties. 
Given that these additional inputs are particularly noisy, 
and more importantly that they show (small) offsets be- 
tween orbits, we chose not to use these parameters in our 
final results, as this can cause flux offsets to be fitted by 
the GP. For ip, the in-transit values are also outside the 
range covered by the out-of-transit data, hence requiring 
an extrapolation to the in-transit data. In theory, these is- 
sues might be addressed with appropriate prior widths in the 
gamma hyperpriors, but this becomes rather subjective with 
no robust way to set them. Of course, in an ideal Bayesian 
framework we should be able to include all the (noisy) in- 
puts. More sophisticated tools to evaluate the posterior dis- 
tribution and marginalise over the hyperparameters such as 
Bayesian Quadrature methods (e.g. O'Hagan 1991) might 
enable this in the future. 

Our approach of using Ohst and W as the only inputs is 
supported by two further pieces of evidence. The first is that 
the intrapixel sensitivity variations are much smaller than 
for NICMOS, with no measurable variation found during 
ground-testing for WFC3, compared with ~ 40% for NIC- 
MOS (Dressel 201 1| ; therefore the variations in flux caused 



tween orbits to fully account for the systematics. Indeed the 
'divide-oof method used should provide a similar correction 
to the GP model using Oh and W as inputs, given that they 
show regular repeating structure for each orbit. 

3.2 Linear basis function model 

As an independent check we also tried using the 'standard' 
method to model the system atics (see e.g. |Brown et al 



200l||Gilliland fc Arribas|2003|[Pont et al.|2007||Swain et aT. 



20081, i.e. using the extracted auxiliary parameters as basis 
functions, and fitting for the coefficients using linear least 
squares (see Gibson et al.|2011a|b for a detailed description 
of the method used here). Using W and Ohst as basis func- 
tions, and higher order squared terms, provided consistent 
results with the GP model (within la), and the residual 
permutation method described in Gibson et al. (2011a) pro- 
vided similar uncertainties. When we added AX, AY and 
ip as additional basis vectors the instrument model became 
unstable and the two visits provided different planet-to-star 
radius ratios for the blue and red transits, validating our use 
of W and Ohst as the only inputs to the GP. 



4 RESULTS 

4.1 WFC3 light curve fits 

The inferred planet-to-star radius ratio for both quadratic 
and non-linear limb darkening laws are given in Tab. [J] 
as both have been used in the literature for HD 189733. 
The uncertainties are calculated from the distribution of the 
MCMC chains, after removing the first 20% of each, and are 
given by the limits which encompass 68.2% of the probability 
distribution. 

The planet-to-star radius ratio obtained with the 
quadratic limb darkening law are plotted in Fig. [5] and 
the dashed line is a weighted mean of all four data points. 
Clearly no planet-to-star radius ratio variation is detected in 
the WFC3 data. A comparison with other data requires cor- 
rections for unocculted star spots, discussed in the following 
section. 



4.2 Spot corrections 

As HD 189733 is an active star, we need to correct the 
transmission spectrum for unocculted star spots to make 
a detailed comparison with data taken at different times 
and wavelengths. Here, we use the spot corrections as de- 
scribed in Sing et al. (20111 in order to place the WFC3 



data in context alongside other transmission spectra of HD 
189733b. However, given the complexity of spot corrections, 
this warrants its own publication, and will form a large part 
of a future paper (Pont et al, in prep), where we will anal- 
yse the complete transmission spectra of HD 189733b from 

7 We have simply given the wavelength range of the extraction 
region; more detailed analyses would need to take into account 
the spectral response, but is not warranted here. 
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Figure 5. Measured planet-to-star radius ratio of HD 189733b 
from WFC3 data using quadratic limb darkening prior to spot cor- 
rections. Visit 1 and 2 are shown by the green and yellow points, 
respectively. The dashed horizontal line marks the weighted av- 
erage of all four points. 



contrast between the spotted and unspotted stellar surface. 
However, these may still influence the extracted transmis- 
sion spectrum. 

Fig. [6 plots the corrected WFC3 data alongside the ACS 



data from 



Sing et al 



Pont ct al. (2008), the NICMOS photometry from 



(20091, and the reanalysis of the NICMOS trans- 



mission spectroscopy from |Gibson et aL] ( |2011b| . The ACS 
data were already corrected for spots using a similar tech- 
nique and were taken directly from the paper; the NICMOS 
spectroscopy and photometry were corrected in the same 
way as the WFC3 data. The WFC3 data are largely con- 
sistent with ACS and NICMOS photometric data, but dis- 
agrees with the red end of the NICMOS spectroscopy. This 
tentatively suggests that the transmission spectrum is dom- 
inated by the haze into the NIR, and the feature at ~1.6um 
may indeed be of instrumental origin ( jGibson et al.|2011b[ ). 
However, given the difficulty in performing spot corrections 
(particularly with the large baseline between the data sets), 
and in extracting the WFC3 data, it is not wise to rule out 
the feature at ~ 1.6 urn from these data alone. 



Table 1. Planet-to-star radius ratio calculated from the GP in- 
ference, given for both limb darkening laws. 



Wavelength ((i.m) 



(quadratic limb 
1.099 - 1.168 
1.082 - 1.128 
1.521 - 1.693 
1.514- 1.671 

(non-linear limb 
1.099 - 1.168 
1.082 - 1.128 
1.521 - 1.693 
1.514 - 1.671 



darkening) 

0.15650 ± 
0.15716 ± 
0.15634 ± 
0.15630 ± 

darkening) 
0.15650 ± 
0.15735 ± 
0.15644 ± 
0.15672 ± 



0.00048 
0.00078 
0.00032 
0.00037 

0.00047 
0.00076 
0.00032 
0.00043 



UV to IR. In addition it is important to model the tran- 
sit light curves with consistent stellar parameters and limb 
darkening laws, which will also be addressed in this paper. 

Correcting for unocculted spots requires long term mon- 
itoring of the stellar flux to obtain the variation in spot 
coverage between visits, and an estimate of the flux of 
the unspotted surface. These were obtained by fitting long- 
baseline ground-based coverage from the T10 0.8 m Auto- 
mated Photoelectric Telescope (APT) at Fairborn Observa- 
tory in southern Arizona, detailed in |Henry fc Winn] ( |2008[ ) 
that covers all the HST visits considered here. The photom- 
etry was fitted using GP regression with the quasi-periodic 
kernel given in Ai grain et al.| ( |2011| ). The spot correction 
was then performed in the same way as described in |Sing| 
et al. (20111, assuming a spot temperature of 4250 K mea- 



sured from STIS and ACS transits. The spot corrections 
shifted the planet-to-star radius ratio for the blue and red 
light curves by —0.0010 and —0.0006, respectively, for visit 
one, and -0.0009 and -0.0005 for visit 2. 

We note that no spot-crossing events are visible in our 
data. It is highly likely that spots were occulted given that 
the optical transits almost always show spot-crossing events 
(e.g. |Pont et aL]|2007| |Sing et aL]|2011| ), rather they are 
not visible in our data given the lower S/N and lower flux 



5 DISCUSSION AND CONCLUSION 

We have presented HST/WFC3 spectroscopic observations 
of the transiting system HD 189733. The aim of this study 
was to bridge the gap between the ACS optical data of |Pont| 
et al.| (|2008 1, near-infrared transmission spectra with NIC- 



MOS (Swain et al.|2008|[Gibson et al.|2011b[ ), and NICMOS 
photometry of |Sing et al.| ( |2009| >. Unfortunately, the WFC3 
spectra were saturated for much of the central region, and 
we could only extract useful light curves at either extreme 
of the G141 grism, corresponding to central wavelengths of 
~1.134 um and 1.607 urn for visit 1 and ~1.105 [im and 1.592 
(xm for visit 2. The red and blue light curves show consistent 
transit depths for both runs when using either quadratic or 
non-linear limb darkening laws. We have not detected a drop 
in the planet-to-star radius ratio one would expect if the op- 
tical haze does indeed become transparent between ~1.1 and 
1.4 um, as has been suggested to explain the different be- 
haviour observed at optical and near-infrared wavelengths. 

Correction for unocculted star spots shows the WFC3 
data are consistent with the AC S data of|Pont et al.| ( |2008[ ) 



and the NICMOS photometry of Sing et al. (2009), but dis- 



agree with the red end of the NICMOS transmission spec- 
troscopy of|Swain et al.| ( |2008| ) and the reanalysis of |Gib- 



son et al. (2011b I. A more detailed analysis of the complete 



transmission spectrum of HD 189733b is in preparation, 
with more detailed spot corrections, consistent limb dark- 
ening laws and stellar parameters. Nonetheless, this work 
tentatively suggests that Rayleigh scattering from the high- 
altitude haze dominates the transmission spectrum up to 
and including near-infrared wavelengths. We note that both 
the W FC3 data and the NICMOS photometry of |Sing et all 
1 2009) have shown consistent transit depths, and have failed 
to produce the deep feature at ~1.6 urn reported in | Swain] 
et al. (20081. However, whilst WFC3 appears better be- 



haved than NICMOS transmission spectroscopy, given that 
the data are near the non-linear range of the detector, and 
the difficulty in performing spot corrections, more evidence 
is required to conclusively resolve the issue and provide con- 
tinuous coverage from optical to near-infrared wavelengths. 
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Figure 6. Transmission spectra of HD 189733 showing the WFC3 data presented in this paper, the ACS data from |Pont e t al. ( 200 8), the 
NICMOS photometry from |Sing et al.| ( |2009| l, and the reanalysis of the NICMOS transmission spectroscopy fro m|Gibson et al.| (|2011b[| 
after correction for unocculted spots (see text). The dashed line is the Rayleigh scattering haze function given in |Lecavelier Pes Etangs] 
|et al.| ( |2008a| ). The WFC3 blue and red light curves show consistent depth, providing no evidence for a 'drop-off' in the haze observed at 
optical wavelengths. The WFC3 data are also consistent with the Rayleigh scattering haze extrapolated into the NIR; however given the 
difficulties extracting the light curves and performing the spot corrections we hesitate to draw firm conclusions from the WFC3 data. 



Future observations of HD 189733 in 'driftscan' mode should 
help provide this evidence. 
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